skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Wan, J"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Host-managed shingled magnetic recording drives (HMSMR) give a capacity advantage to harness the explosive growth of data. Applications where data is sequentially written and randomly read, such as key-value stores based on Log-Structured Merge Trees (LSM-trees), make the HMSMR an ideal solution due to its capacity, predictable performance, and economical cost. However, building an LSMtree based KV store on HM-SMR drives presents severe challenges in maintaining the performance and space efficiency due to the redundant cleaning processes for applications and storage devices (i.e., compaction and garbage collections). To eliminate the overhead of on-disk garbage collections (GC) and improve compaction efficiency, this paper presents GearDB, a GC-free KV store tailored for HMSMR drives. GearDB proposes three new techniques: a new on-disk data layout, compaction windows, and a novel gear compaction algorithm. We implement and evaluate GearDB with LevelDB on a real HM-SMR drive. Our extensive experiments have shown that GearDB achieves both good performance and space efficiency, i.e., on average 1:71 faster than LevelDB in random write with a space efficiency of 89.9%. 
    more » « less
  2. Key-value (KV) stores play an increasingly critical role in supporting diverse large-scale applications in modern data centers hosting terabytes of KV items which even might reside on a single server due to virtualization purpose. The combination of ever growing volume of KV items and storage/application consolidation is driving a trend of high storage density for KV stores. Shingled Magnetic Recording (SMR) represents a promising technology for increasing disk capacity, but it comes at a cost of poor random write performance and severe I/O amplification. Applications/software working with SMR devices need to be designed and optimized in an SMR-friendly manner. In this work, we present SEALDB, a Log-Structured Merge tree (LSM-tree) based key-value store that is specifically op- timized for and works well with SMR drives via adequately addressing the poor random writes and severe I/O amplification issues. First, for LSM-trees, SEALDB concatenates SSTables of each compaction, and groups them into sets. Taking sets as the basic unit for compactions, SEALDB improves compaction efficiency by mitigating random I/Os. Second, SEALDB creates varying size bands on HM-SMR drives, named dynamic bands. Dynamic bands not only accommodate the storage of sets, but also eliminate the auxiliary write amplification from SMR drives. We demonstrate the advantages of SEALDB via extensive experiments in various workloads. Overall, SEALDB delivers impressive performance improvement. Compared with LevelDB, SEALDB is 3.42× faster on random load due to improved compaction efficiency and eliminated auxiliary write amplification on SMR drives. 
    more » « less
  3. NAND flash-based Solid State Devices (SSDs) offer the desirable features of high performance, energy efficiency, and fast growing capacity. Thus, the use of SSDs is increasing in distributed storage systems. A key obstacle in this context is that the natural unbalance in distributed I/O workloads can result in wear imbalance across the SSDs in a distributed setting. This, in turn can have significant impact on the reliability, performance, and lifetime of the storage deployment. Extant load balancers for storage systems do not consider SSD wear imbalance when placing data, as the main design goal of such balancers is to extract higher performance. Consequently, data migration is the only common technique for tackling wear imbalance, where existing data is moved from highly loaded servers to the least loaded ones. In this paper, we explore an innovative holistic approach, Chameleon, that employs data redundancy techniques such as replication and erasure-coding, coupled with endurance-aware write offloading, to mitigate wear level imbalance in distributed SSD-based storage. Chameleon aims to balance the wear among different flash servers while meeting desirable objectives of: extending life of flash servers; improving I/O performance; and avoiding bottlenecks. Evaluation with a 50 node SSD cluster shows that Chameleon reduces the wear distribution deviation by 81% while improving the write performance by up to 33%. 
    more » « less
  4. The ALICE Collaboration reports measurements of the large relative transverse momentum ( k T ) component of jet substructure in p p and Pb-Pb collisions at center-of-mass energy per nucleon pair s NN = 5.02 TeV . Enhancement in the yield of such large- k T emissions in head-on Pb-Pb collisions is predicted to arise from partonic scattering with quasiparticles of the quark-gluon plasma. The analysis utilizes charged-particle jets reconstructed by the anti- k T algorithm with resolution parameter R = 0.2 in the transverse-momentum interval 60 < p T , ch , jet < 80 GeV / c . The soft drop and dynamical grooming algorithms are used to identify high transverse momentum splittings in the jet shower. Comparison of measurements in Pb-Pb and p p collisions shows medium-induced narrowing, corresponding to yield suppression of high- k T splittings, in contrast to the expectation of yield enhancement due to quasiparticle scattering. The measurements are compared to theoretical model calculations incorporating jet modification due to jet-medium interactions (“jet quenching”), both with and without quasiparticle scattering effects. These measurements provide new insight into the underlying mechanisms and theoretical modeling of jet quenching. 
    more » « less
  5. Abstract Event-by-event fluctuations of the event-wise mean transverse momentum,$$\langle p_{\textrm{T}}\rangle $$ p T , of charged particles produced in proton–proton (pp) collisions at$$\sqrt{s}$$ s = 5.02 TeV, Xe–Xe collisions at$$\sqrt{s_{\textrm{NN}}}$$ s NN = 5.44 TeV, and Pb–Pb collisions at$$\sqrt{s_{\textrm{NN}}}$$ s NN = 5.02 TeV are studied using the ALICE detector based on the integral correlator$$\langle \!\langle \Delta p_\textrm{T}\Delta p_\textrm{T}\rangle \!\rangle $$ Δ p T Δ p T . The correlator strength is found to decrease monotonically with increasing produced charged-particle multiplicity measured at midrapidity in all three systems. In Xe–Xe and Pb–Pb collisions, the multiplicity dependence of the correlator deviates significantly from a simple power-law scaling as well as from the predictions of the HIJING and AMPT models. The observed deviation from power-law scaling is expected from transverse radial flow in semicentral to central Xe–Xe and Pb–Pb collisions. In pp collisions, the correlation strength is also studied by classifying the events based on the transverse spherocity,$$S_0$$ S 0 , of the particle production at midrapidity, used as a proxy for the presence of a pronounced back-to-back jet topology. Low-spherocity (jetty) events feature a larger correlation strength than those with high spherocity (isotropic). The strength and multiplicity dependence of jetty and isotropic events are well reproduced by calculations with the PYTHIA 8 and EPOS LHC models. 
    more » « less